Neural network prediction of geomagnetic activity: 
a method using local Holder exponents 

Z. Voros and D. Jankovicova 
Geomagnetic Observatory Hurbanovo, Geophysical Institute SAS, 

Slovak Republic 

Abstract 

Local scaling and singularity properties of solar wind and geomagnetic 
time series were analysed using Holder exponents a. It was shown that in 
analysed cases due to multifractality of fluctuations a changes from point to 
point. We argued there exists a peculiar interplay between regularity / irregu- 
larity and amplitude characteristics of fluctuations which could be exploited 
for improvement of predictions of geomagnetic activity. To this end layered 
backpropagation artificial neural network model with feedback connection 
was used for the study of the solar wind - magnetosphere coupling and pre- 
diction of geomagnetic Dgt index. The solar wind input was taken from 
principal component analysis of interplanetary magnetic field, proton den- 
sity and bulk velocity. Superior network performance was achieved in cases 
when the information on local Holder exponents was added to the input layer. 

1 Introduction 

One of the goals of solar-terrestrial physics is to predict the response of magneto- 
sphere-ionosphere system to highly variable conditions in the solar wind (SW). The 
question of solar wind-magnetosphere coupling (SWMC) can be studied by means 
of input-output modelling. Linear input-output techniques (or linear prediction 
filtering) describe the SWMC by a linear moving-average (MA) filter assuming 
that the convolution of a time-invariant transfer function (TF), with an earlier SW 
input can predict the magnetospheric output represented by time series of geomag- 
netic indices [l yemori et al.(1979) . Bargatze et aL(1985), McPherron etal.(1988) j . 
The TF characterizes the magnetospheric response and can be estimated directly 
from data provided that a sufficiently large number of input-output pairs is avail- 
able. In fact, [ Bargatze et al .(1985) | using the vB^ — AL input-output data (v - 
solar wind velocity, - interplanetary magnetic field N — S component, AL - 
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auroral zone geomagnetic index) have shown that the linear MA filters can iden- 
tify two different regimes in which SW energy is dissipated within the magne- 
tosphere (directly driven and loading-unloading regimes). At the same time, the 
best linear MA filters do not predict the geomagnetic output precisely, unless 
strongly varying filter parameters are considered in each case of activity level sep- 
arately [Blanchard and McPherron(1994) |. Different levels of geomagnetic activ- 
ity and the nonlinearity of the SWMC were then treated by nonlinear MA filters 
IPrice et al.(1994),(Vassiliadis et al.(1995) | using the assumption that the geomag- 
netic activity is a nonlinear function of the solar wind input. Actually, local linear 
(that is nonlinear) MA filters were used, which represent a linear approximation 
of the nonlinear system. Nonlinear MA filters proved to be better predictors of 
geomagnetic response as the linear ones, but the internal dynamics of the magneto- 
sphere and the additional influence of it on the geomagnetic response itself (a feed- 
back) was more explicitly considered within the frame of state-input space models 
IVassiliadis et al.(1995)| . Here the prediction of magnetospheric states is made 
within a common input (solar wind) - output (geomagnetic data) phase space and 
the local linear (nonlinear) approximation is given by an evolution of the nearest 
neighbours of a phase space point. |Vassihadis et al.(1995)| found that in compar- 
ison with linear state-input models (global aproximation) the nonlinear state-input 
models (local approximation based on nearest neighbours) give better predictions 
of geomagnetic activity. 

An alternative to the above MA filters is represented by artificial neural net- 
works (ANN) which are global nonlinear functions. Elman recurrent ANN was 
used by IMun sami (2000) J to model SW forcing of the westward auroral elec- 
troject and the storm-time ring current. In predicting geomagnetic activity their 
performance was similar to that of linear filters |Hemandez et al.(1993)| . Signifi- 
cantly better performance was achieved by gated ANNs that accounted for differ- 
ent levels of activity. [Weigel et al.(1999)| used three individual ANNs for mod- 
elling low, medium and high vB^ , AL activity levels using data from database of 
|Bargatze et al.(1985)| . The outputs of these ANNs together with past geomagnetic 
outputs were used to train the gate network. It was shown by [Weig el et al.(1999)| 
and | Weigel(2000)| that the gated architecture give significantly better predictions 
as the ungated one or the ARM A system reported by [H ernandez et al.(1993)j . 
Obviously, the gated ANN architecture resembles the state-input space model of 
IVassiliadis et al.(19 95) | giving account for changing activity levels. Local linear 
filters can be calculated in a neighbour of any point in state-input space, the gated 
ANN, however, uses only three levels of activity. 

In this paper we propose a method which allows to consider the changing level 
of SW fluctuations. Instead of building a more structured gated ANN architec- 
ture we use the extra information on local scaling characteristics of properly in- 
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troduced measure which can be estimated directly from a time series. Multifrac- 
tals exhibit time-dependent scaling laws and hence allow a description of irregular 
phenomena that are localized in time. Multifractal scaling characteristics of geo- 
magnetic fluctuations were studied by [Consolini et al.(1996)J and |,Voros(2000)J . 
[Jankovicova et al.(2001)| using multilayer feed-forward ANN have shown that the 
information on multifractal characteristics of geomagnetic data put to the input en- 
hanced the performance of their ANN in reconstructing AE-index time series from 
geomagnetic observatory data. The inclusion of multifractality, however, some- 
what amplified the noise component in this case. We expect that the inclusion of 
the scaling characteristics of solar wind and geomagnetic fluctuations to the ANN 
modelling of SWMC offers a way for considering essential local information on 
rapid changes, irregularities and intermittence not considered enough hitherto. In- 
termittence of SW and geomagnetic fluctuations was not built into nonlinear fil- 
ter or ANN models. Notwithstanding that SW fluctuations proved to be strongly 
intermittent [Burla ga(199T)]|Carbone(1994)||Marsch et al.(1996)] |Tu et al.(1996)i 



Bruno et al.( 1999) I and also both nonlinear magnetotail theories [Chang(1999)| 
Chapman et al.(1998) Klimas et al.(2000) | and experimental works [Consolini et al.(1996)| 



Borovsky et al.(1997)||Co'nsolini and De Michelis(1998)||Consolini and Lui(1999)l 



V6r6s(2000)||Kovacs et al.(2001) ' Watki ns et al.(20 01)l predict or confirm the pres- 
ence of scalings, multifractality and intermittence within the magnetosphere. Though 
there exist competing theoretical concepts regarding the underlying physical mech- 
anisms which may or may not produce the observed scalings [Freeman et al.(2000)] 
Antoni et al. (200 1)1 these considerations have no effect on our analysis. We simply 



ask what are the scaling characteristics of fluctuations and how can this information 
improve our ability to predict geomagnetic activity using ANNs. 



2 Data analysis methods 

2.1 Local scaling characteristics: the Holder exponents 

We consider the accumulated amount of signal energy within a window W : {ti — 
W,ti). The signal energy E within a window W is computed as a sum of the 
squared amplitudes of time series through 

i 

EiU) = X\U)- i = l,2,...,N (1) 

i-W 

and 

N 

Y.X\t.{) = l (2) 

i=l 
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where X{ti) represents a time series, N is the total number of data points. The 
distribution of E in time is considered to be a measure which may also appear as 
singular. Mathematically, a measure can be characterized by its density. An erratic 
behaviour appears in the absence of a density for a singular measure. Generally, 
singular distributions can be characterized locally by the so-called singularity or 
Holder exponents a |H alsey et al.(1986)||Muzy et al.(1994)||Vehel and Vojak(1998)| . 



Loosely speaking, the exponent a quantify the degree of regularity or irregularity 
(singularity) in a distribution or a function in a point tj. Usually, the measure 
E{ti, W) within a window W scales as W^. Therefore, a can be estimated by a 
regression method using 

.... lo9E{U,W) 

taking different window lengths W . For a monofractal a(tj) = const for all 
ti, while in a case of multifractal measure (non-uniform distribution) a changes 
from point to point (non-stationarity). For instance, fractional Brownian motion 
or continuous Ito processes represent self-affine fluctuations governed by a single 
Holder exponent. The global distribution of singularity exponents a for geomag- 
netic fluctuations was studied by |Consolini et al.(1996)| and |Voros(2000)1 . It 
was shown that on the time scale of substorms and storms geomagnetic fluctua- 
tions seem to be analogous to the simple multiplicative p-model which describes 
energy cascade processes in turbulent flows. This model explains how a specific 
energy flux introduced on large scales to a flow can lead to non-homogeneous, in- 
termittent energy distributions on small scales. On this basis we expect that in case 
of homogeneous energy transfer rate between scales with no intermittency effects, 
the above defined distribution will be stationary and a{ti) ~ 1 for all ti. Other- 
wise, a{ti) < 1 indicate irregularities, sharp variations around ti, while a{ti) > 1 
is found in regions where events are more regular |Ried i and Vehel(1997)| . In 
case of multifractal processes a changes from point to point, which usually makes 
difficult the numerical estimation of q's. A number of papers deals with this 
question | |Muzy et al.(1994)| [Jaffard and Meyer(1996)| paUat and Hwang(I992) 



|Vehel and Vojak(1998)| . Though the Holder exponents do not characterize the lo- 
cal regularity properties of a signal completely [Gui heneuf et al.(I998) |, we are 
going to use the simple relation (3) to show that even a rough estimation of local 
scaling characteristics of the signal may enhance the performance of ANNs. We 
note that a running numerical estimate of a may fluctuate sharply for other, from 
multifractality different, nonstationary processes. 
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2.2 ANN description 



A layered backpropagation ANN model |Rumelhart et al.(1986) Krose and Smagt(1996)[ 



with feedback connection from output layer to input layer was constructed. The 
output-input layer connection makes the output history to be an ordinary input unit 
in training process. The output of the model can be expressed in the form 

Q T T 

y{t + At) = FiJ2 mfkiY. ^jklf^ " j^^) + E ^Jklf {t - jAr) + 

k=l j=0 j=0 

T 

E '^ikVi't - «Ar) + Vq) + Wq) (4) 

where y denotes the Dgt time series; the two inputs equal I^^) = Pel and /(^) = 
Pc2; T the history; Ar the time resolution (Ar = 1 h); Uik,Vjk the weights be- 
tween input and hidden layers; Wk the weights between hidden and output layers; 
vo,wo the biases of the layers; Q the number of hidden units; F and fk the non- 
linear activation function. In our model fk are the hyperbolic tangent and F the 
linear activation functions and Q = Q. The performance of the ANN model was 
evaluated through root mean squared error (RMSE) and correlation coefficient 
(P) 

N 



RMSE = {{{Y.ivT' - vr''?)/N) (5) 

i=l 

Eiii(yr*-y°"*)(2/r'-r^'^') 



(T yOut (Jypred 



(6) 



where y""* denotes an actual output, its mean value and yP'"'^'^ a one-step ahead 
prediction of ANN, its mean value; N is their length; ayout and aypred are the 
standard deviations of y""* and yP'^'^'^. 



3 Data analysis 

In this paper we are going to predict the Dst index one hour in advance using 
the layered backpropagation ANN model with feedback connection. Prior to that, 
we show several examples which demonstrate that the Holder exponents estimated 
by Equation 3 provide local characteristics of the analysed time series sensitive 
enough to capture the necessary information on the abrupt changes and activity 
levels. 

Figure la shows interplanetary magnetic field (IMF) variations registered by 
the ACE satellite which is continuously monitoring the SW at the Li Earth-Sun 
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Lagrange point. The time resolution is 16 5 and 5 hours of data is shown from 
January 14, 1998, 05:20 UT. This is a time period of very low activity level with 
mean value of IMF ACE B fluctuations of 3 nT The Holder exponents estimated 
within variable window length W € (16, 16 * 160) s at each point are depicted in 
Figure lb. It is visible that a fluctuates around its mean value a ~ 1, which means 
that the measure is almost uniformly distributed. The energy content of the signal 
E, and its scaling with window length, that is ~ VF", is shown in a log-log plot in 
Figure Ic. 

In contrast with Figure 1, Figure 2 shows a more disturbed period of IMF ACE 
B variations from March 31, 2001 from 00:00 to 05:00 UT. The mean value of 
B is 43 nT. Large departures from a = 1 are present (Figure lb), mainly within 
time periods of enhanced fluctuations. These periods are characterized by sudden 
increase of regularity (a > a) followed by periods of low regularity (a < a) or 
vice-versa. 

In fact, a appears to be a sensitive indicator of fluctuations which may occur 
during periods of enhanced IMF B amplitudes, however, when the fluctuations 
cease, the values of a return to a ~ 1, independently on the actual amplitudes. A 
good example of it is visible within the time interval t G (2600, 5000) s in Figures 
2 a, b, where > 50 nT and a ~ 1. Moreover, the local fluctuations of a around 
a seem to be larger when the gradient of B increases, but it is not always valid (not 
shown). There is also a clear difference between the scalings in Figure Ic and 2c. 

We conclude that, besides the amplitude of magnetic field variations, the local 
scaling properties of signal described by Holder exponents a (Equation 3) may 
represent an essential piece of information the consideration of which would allow 
a better prediction of future geomagnetic activity. 

Other examples of longer period data sets (from March 19 to April 25, 2001) 
are depicted in Figure 3. This time, IMF B^ from ACE satellite and the Dgt in- 
dex are considered with time resolution of 1 hour. The thick line in Figure 3a 
corresponding to B^ = — W nT highlights periods of enhanced SWMC. 

[Gonzalez and Tsurutani(I987)| have shown that the interplanetary causes of 
intense magnetic storms (Dst < —100 nT) are long duration (> 3 h) large and neg- 
ative (< —10 nT) Bz events associated with interplanetary duskward electric fields 
> 5[mVm^^]. Comparison of Figures 3a, d shows an agreement with the above 
criteria, that is, long duration negative IMF Bz events occur together with intense 
magnetic storms. Horizonthal thick line corresponds to the limit of Dgt = —100 
nT in Figure 3d. Figure 3b shows the normalized measure E and the estimated 
Holder exponents are in Figure 3c. Approximately the same behaviour is visible 
as previously (Figure 2), which may be even better visualised by drawing 3D plots 
of time, IMF B^ or Dgt index and the corresponding Holder exponents as in Fig- 
ures 4a, b. In both cases when the above mentioned physical limits of amplitudes 
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{Bz < — 10 nT and Dgt < —100 nT) are crossed, the Holder exponents have 
their local minima, a < a, indicating sharp irregular variations. Intense magnetic 
storms {Dst < —100 nT and a < a) are usually preceeded by sudden increases of 
a » a, that is, by short periods of increased regularity (Figure 4b). The same ef- 
fect is present in time series (Figure 4a), though, except the large event around 
~ 300 hours, it is less visible. 

We expect that precisely the interplay between regularity / irregularity and 
amplitude characteristics should be learnt by ANNs to achieve superior perfor- 
mance. The simplest way to realize that is to add, besides the amplitudes of the 
analysed variables, the corresponding series of Holder exponents to the ANN in- 
put. The following ACE SW parameters with At = 1 hour time resolution were 
used: B^, By, B^, |B|, n, v. The time evolution of 1 hour Dst index from 
January 1 to July 28, 2001 was considered. The time series of SW parameters 
were preprocessed using principal component (Pc) analysis ['Gnanadesikan( 1 977) | 
[Reyment and Joreskog(1996)] . The linear combinations of normalized SW param- 
eters, their derivatives and combinations: B^, By, Bz, |B|, n, v, nv, n|B|, v|B|, 
un|Bl, dBx/dt, dBy/dt, dBz/dt, d|B|/dt, dv/dt, dn/di were used for the calcu- 
lation of the Pc's. It was shown by Pankovicova et al.(2002)| , that for the consid- 
ered set of SW parameters, most of the variance of SW fluctuations is controlled 
by the first two components. In this paper we use Pel and Pc2 as SW input time 
series. 

The local scaling characteristics of the principal components are described in 
the same way as of the other SW parameters. The time interval under study was 
divided into two subsets. The first one (part A in Figure 5) from January 1 to March 
14, 2001 was used for ANN training while the second one (part B in Figure 5) from 
March 15 to July 28, 2001 represented independent set for prediction, not included 
in ANN training process. The influence of inclusion of local Holder exponents on 
ANN performance was tested for a set of values of history T and window length W, 
whilst T = W. In all cases analysed here a feedback consisting of past T values 
of Dst index was set. Figure 6 shows the dependence of correlation coefficient p 
(Equation 6) in three different cases: 1.) Holder exponents a are not considered 
on input at all - only Pel, Pc2 and the Dst feedback with history T (indicated 
by a continuous line); 2.) Holder exponents of Pel and Pc2 vectors are added as 
input (marked by "*"); 3.) as in case 2.), but Holder exponents describing the local 
scaling properties of past Dst values are also added as an extra input (depicted 
by "o"). The effect of the inclusion of Holder exponents is evident mainly in the 
superior performance of ANNs in case 3. The correlation coeficient p achieves its 
maximum pmax = 0.99 at = T = 2h and decreases with increasing T and 
W. At the same time ANN performance is practically unchanged in cases 1 and 
2 when T and W increase. We mention that without the Dst feedback p slowly 
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increases with T | Jankovicova et al.(2002)| . As it can be seen, the consideration 
of scaling properties of Pel and Pc2 SW data enhances a little the performance 
level of ANN, but a real improvement is achieved when the singularity or regularity 
properties of geomagnetic fluctuations are taken into account, too (case 3). It seems 
to confirm our expectation that the information on local scaling properies of signals 
put to the input layer allows to learn input-output relations better accounting for 
changing activity levels more effectively. The analysis of RAISE (Equation 5) 
leads to the same conclusion. For demonstration 1 hour ahead predictions of an 
intense geomagnetic storm are shown in Figure 7a. Two methods are compared 
(Figure 7b, c): case 1 as defined above, when the Dgt index is predicted without 
Holder exponents and case 3, with the information on a's (Pel, Pc2 and Dst) 
added to the input layer (the cases 1 and 2 are similar). Easy to recognize that the 
method using q's (case 3) allows to predict almost all the variance in the data with 
p = 0.99 and RMSE = 2 nT having T = = 2 h. At the same time p = 0.93, 
RMSE = 7 nT for r = 14^ = 2 h (Figure 6 in case without Holder exponents). 
In comparison, ||Wu and Lundstedt(1996) | have exploited Elman recuiTcnt ANNs 
to predict the Dgt index 1 hour ahead only from SW data. They achieved p = 0.91 
and RMSE = 16 nT. 



4 Conclusions 

We presented a prediction technique which uses the extra information on local 
scaling exponents to improve the performance of a layered ANN with feedback. 

It was demonstrated that the Holder exponents a are time dependent and change 
from point to point exhibiting large deviations from the mean value a = 1, mainly 
during enhanced activity levels of fluctuations. A peculiar interplay between regu- 
larity / irregularity features (described by a) and amplitude characteristics of dis- 
turbances was found and demonstrated on examples of SW and geomagnetic data. 
ANN performance was significantly improved by putting the Holder exponent time 
series of corresponding SW and geomagnetic past data to the input layer yielding 
the least RMSE error of 2 nT for short history T = 2 h and window length 
W = 2 h. The results obtained without Holder exponents were the worst (p ~ 
0.93, RMSE ~ 7 nT). Only a small improvement if any was achieved when the 
Holder exponents of SW Pel and Pe2 were added only (p ~ 0.94, RMSE ~ 6 
nT). It means that to understand and model better the magnetospheric response, in 
addition to SW input and geomagnetic history (feedback), the scaling and irreg- 
ularity / regularity features of magnetospheric fluctuations should also be taken 
into account. It is not an unexpected result, however, because recent nonlin- 
ear theories on SWMC or magnetotail dynamics involve or predict the appear- 
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ance of scalings, irregularities (singularities) and turbulence [Galeev et al.(1986')l 
|niang(1999) . Chapman et al.(1999)||Klimas et al.(20(]n)1 |. To fully exploit this ap- 



proach on experimental basis, further investigations of scalings and singularity fea- 
tures of fluctuations in different inner and outer regions of the magnetosphere will 
be necessary. 
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Figure 1: Period of low activity level; a. interplanetary magnetic field from ACE 
satellite (time resolution 16 [s]); b. the estimated time series of Holder exponents 
a c. the energy content of the signal versus window length W. 
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Figure 2: Period of high activity level; a, b, c - same as in Figure 1. 
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Figure 3: a. Interplanetary magnetic field component (time resolution 1 hour; 
b. the corresponding energy content E; c. the Holder exponents; d. geomagnetic 
Dst index. 
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Figure 4: The interplay between regularity / irregularity and amplitude character- 
istics; a. interplanetary magnetic field B^; b. geomagnetic Dgt index. 
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Figure 5: Dgt time series from January 1, 2001 to July 28, 2001 used for ANN 
analysis (A- the period for training process; B- independent set for prediction; 
thick vertical line divides A and B). 
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Figure 6: Correlation coefficient dependence on history T or window length W (T 
= W)\ - - without Holder exponents; * - Holder exponents for vectors of Pel and 
Pc2 inputs considered; o - Holder exponents for inputs Pel and Pc2 and for past 
Dgt index considered. 
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Figure 7: a. 1 hour ahead prediction of Dgt index for period from March 10, 
2001 10:00 UT to March 13, 2001 08:00 UT (• - actual output; - - prediction 
without Holder exponents; o - prediction with Holder exponents of Pel, Pc2 and 
Dgt on input; b. differences between actual Dgt and predicted Dgt without Holder 
exponent time series; c. differences between actual Dgt and predicted Dgt with 
Holder exponent time series on input. 
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